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Abstract. Quasispecies theory predicts that there is a critical mutation probabiUty above which a viral 
population will go extinct. Above this threshold the virus loses the ability to replicate the best adapted 
genotype, leading to a population composed of low replicating mutants that is eventually doomed. We 
propose a new branching model that shows that this is not necessarily so. That is, a population composed 
of ever changing mutants may survive. 
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1 Introduction. 

Compared to other species an RNA virus has a very high mutation rate and a great deal of genomic diversity. 
Hence, a virus population can be thought of as an ensemble of related genotypes called quasispecies, see Eigen 
(1971) and Eigen and Schuster (1977). From the virus point of view a high mutation rate is advantageous 
because it may create rather diverse virus genomes, this may overwhelm the immune system of the host 
and ensure survival of the virus population, see Vignuzzi et al. (2006). On the other hand, a high mutation 
rate may result in many nonviable individuals and hurt the quasispecies, see Sanjuan et al. (2004) and 
Elena and Moya (1999). It seems therefore that mutation rates should be high but not too high. A simple 
mathematical model makes this point. Consider a virus population having genomes 1 and 2, where genome 
1 has a higher replication rate oi and genome 2 has a lower replication rate a2- We suppose that when type 
1 individuals replicate, the new individual has a type 1 genome with probability 1 — r and a type 2 genome 
with probability r. Type 2 genome individuals do not mutate. The model is then 



where Vi is the number of type i genomes for i = 1, 2. This is a variation of a model in Section 8.5 of Nowak 
and May (2000). A slightly different but perhaps better interpretation of this model is to think of genome 1 
as being a specific (high performing) genome and genome 2 as the collection of all the other genomes in the 
population. 

This system of differential equations is easily solved, and one can check that the ratio Vi/v2 converges as 
t goes to infinity. It turns out that the limit is strictly positive if and only if r < 1 — a^jai. That is, in order 
for type 1 to be maintained in the population the mutation r needs to be below the threshold 1 — a2/ai. 
Hence, this model predicts that above a certain mutation threshold faithful replication of the best adapted 
genotype is compromised. Moreover, there seems to be general agreement in the biology literature that above 
this threshold the virus population will go extinct, see Eigen (2002) and Manrubia et al. (2010). We propose 
here a simple stochastic model that shows that this is not necessarily so. In our model the population may 
survive, even if faithful replication of the best adapted genotype is compromised, with the population being 
composed of ever changing mutants. 
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Our results may be biologically relevant for the following reason. An important current strategy to fight 
HIV and other viruses is to try to increase the mutation probability of the virus, see Eigen (2002) and 
Manrubia et al (2010). This assumes that above a certain mutation threshold the virus will die out. Our 
model suggests that at least in theory this strategy may not work. 

We now describe our continuous time evolution process. Let be a probability distribution with support 
contained in [0, oo) and which is absolutely continuous with respect to Lebesgue measure, and let r G [0, 1]. 
Start with one individual at time 0, and sample a birth rate A from the distribution /i. The individual gives 
birth at rate A and dies at rate 1. Every time there is a birth the new individual: (i) with probability 1 — r 
keeps the same birth rate A as its parent, and (ii) with probability r is given a new birth rate A', sampled 
independently of everything else from the distribution /.t. We think of r as the mutation probability and the 
birth rate of an individual as representing the fitness or genotype of the individual. Since /i is assumed to be 
continuous, a genotype cannot appear more than once in the evolution of the population. For convenience 
we label the genotypes in the order of their appearance. 

Let Z{t) denote the number of individuals alive at time t. We say that the evolution process survives if 
Z{t) > V t > and otherwise dies out. Our main interest is in determining whether survival with positive 
probability is possible and by what mechanism can survival be achieved. 

Theorem 1. For Q < r < 1 and probability distributions fi on [0,oo), the evolution process survives with 
positive probability if and only at least one of the following survival conditions holds: 

(I) M({A:A(l-r)>l})>0, 



(II) / M) > 1. 

J{A:A(l-r)<l} i - Al^l - r) 

The two extreme cases r = and r — 1 are easy to understand. If ?- = then (II) cannot hold and (I) 
reduces to fi{{l,oo)) > 0. In this case, no new types are ever produced, the initial branching rate is used 
forever by all individuals. Conditional on the initial branching rate A, Z{t) is a linear birth-death process 
which survives iff A > 1. Thus (I) is equivalent to positive probability of survival. When r — 1, (I) cannot 
hold and (II) reduces to / A(i^(A) > 1. Now each new individual is a new genotype. It is not hard to see 
that conditional on a given individual's branching rate A, the total number of offspring of that individual is 
k with probability 

1 / A \fe , 

fc = 0,l,.. 



l + AVl + A 

with mean A. Thus the unconditional mean number of offspring of the first individual is J Xfi{dX), and the 
total number of individuals that ever live in the evolutionary process is the same as the total progeny in 
a Gallon- Watson process with an offspring distribution which has this mean. The total progeny is infinite 
with positive probability if and only if this mean is larger than 1, so (II) is equivalent to positive probability 
of survival. 

Condition (I) corresponds to the prediction of the differential equation model (jl.ip . That is, below a 
certain threshold for the mutation probability the virus can survive because a well adapted (i.e. high A) fixed 
genotype can survive. However, if (I) fails it is still possible to have survival by (II). In this case survival 
holds because of a growing "cloud" of ever changing mutants of low replicative ability. 

Observe that for any e > and r in [0, 1) there are distributions /i for which (I) holds but / Xdfi{X) < e. 
This shows that the behavior of our evolution process is drastically different from the classical Gallon- Watson 
process in homogeneous or random environments. For these processes survival is possible if and only if the 
expected offspring (or a closely related expectation) is large enough (see Harris (1989) for homogeneous 
environments and Smith and Wilkinson (1969) for random environments). 

It is clear that if the support of fi is unbounded then (I) holds for all r < 1, so for interesting examples 
we consider distributions with compact support. Among these distributions a natural family to consider is 
the uniform distribution on [0, a], a > 0. As the following shows, this class exhibits all possible types of 
survival behavior depending on the exact values of a and r. 
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Corollary 1. Let ^ be the uniform distribution on [0,a], a > 0. If < a < 1 then the evolution process 
dies out a.s. for all r G [0,1], while if a > 2 the evolution process survives with positive probability for all 
r G [0, 1]. If a = 2 then the evolution process dies out a.s. for r = 1 and survives with positive probability for 
all r CI [0,1). If 1 < a < 2 then there exists rc G (1 — 1) such that 

(a) // r < 1 — i then (I) holds and the evolution process survives with positive probability. 

(b) If 1 — ^ < r < rc then (II) holds and the evolution process survives with positive probability. 

(c) If r > r^ then the evolution process dies out a.s. 



In words, whether the population goes extinct when the mutation rate is above a certain threshold 
depends crucially on the value of a. If a > 2 there is no such threshold: the population survives for any 
mutation probability r. Note also that for 1 < a < 2 there are two distinct thresholds: 1 — - and re- If 
r < 1 — i a well adapted genome may survive forever while if 1 — i < r < no fixed genome can survive 
forever. In this regime the population survives as a growing cloud of ever changing mutants. Finally, if 
r > Tc the population goes extinct. 



2 Proof of Theorem 1 

Proof of Theorem 1. Recall that we start with a single genotype 1 individual at time 0. Let Xt be the 
number of type 1 individuals alive at time t. Conditional on the initial branching rate A, Xt is a birth-death 
process with individual birth rate A(l — r) and death rate 1. In particular, it is well known (see Chapter 4 
of Karlin and Taylor (1975)) that it survives with positive probability if and only if A(l — r) > 1, and that 

(2.1) £;(Xt|A)=exp((A(l-r)-l)t). 

Integration of the condition A(l — r) > 1 with respect to /i gives 

P{Xt > 1 V <> 0) > iff ^i{{X : A(l - r) > 1}) > 0. 

Now let Yt be the number of different genotypes born up to time t that are offspring of genotype 1 
individuals. Then Yt t Y^ as t ^ oo, the total number of different genotypes ever produced by genotype 1 
individuals. Note that if r > then Yoo < oo if and only ii Xt = eventually. For ft, > it is easy to see 
that 

E{Yt+h - Yt\X,Xt) = XrhX{t)+o{h) as /i i 0, 

from which it follows that 

j^E{Yt\X) ^ XrE{Xt\X) 

and therefore, using (|2.ip . 

E(Yt\X) = rx[ E{Xs\X)ds=^rX [ exp((A(l - r) - l)s)ds. 
Jo Jo 

Integration with respect to the measure fi now yields 

E{Yt)^ / rXexp{{X{l-r)-l)s)dsdn{X). 

Jo Jo 

By the monotone convergence theorem, E{Yt) t E{Yoo) as t — !> oo. Letting m(r) = E{Yoo), it is easy to show 
using the above that 



m(r) = 



oo if ^({A : A(l -r) > 1}) > 

rl/{l-r) 

Jo l-X{l-r/ ^^^^ ifM{A:A(l-r)>l})=0. 
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We now define the tree of genotypes first introduced by Schinazi and Scfiweinsberg (2008) for a different 
model. Assume (I) does not fiold, and tfius Yoo < oo a.s. Each, vertex in the tree will be labeled by a positive 
integer. There will be a vertex labeled k if and only if an individual of genotype k is born at some time. We 
draw a directed edge from j to k if the first individual of genotype k to be born had an individual of genotype 
j as its parent. This construction gives a tree whose root is labeled 1 because all individuals are descendants 
of the individual of genotype 1 that is present at time zero. The tree of genotypes is a (discrete time) 
Galton- Watson tree with offspring distribution pk = P{Yoo = k). The mean of the offspring distribution is 
m{r), and hence, the tree of genotypes is infinite with positive probability if and only if m{r) > 1. 

To finish the proof, we claim that there are only two ways for the evolution process to survive: either a 
fixed genotype survives forever with positive probability ((I) holds), or the tree of genotypes is infinite with 
positive probability ((II) holds). It is clear that if either of these occur then the evolution process survives 
with positive probability. Suppose now that both (I) and (II) fail. Then with probability one each genotype 
that ever appears gives birth to only finitely many individuals and also the tree of types is finite a.s. This 
means that the total number of individuals that ever appear is finite. □ 

3 Proof of Corollary 1 

Let fj, be the uniform distribution on [0, a] where a > 0. Then (I) is equivalent to a(l — r) > 1. If a(l — r) < 1 
then 



(3.1) m(r) = i r dX. 



The case < a < 1. Here a(l — r) < 1 for all r e [0, 1], so (I) does not hold. Furthermore, the fact that 
a < 1 implies that the integrand in (jS.ip is an increasing function of r. Thus for all r £ [0, 1], 

m{r) < to(1) = a/2 < 1, 

and hence (II) also fails. For every r the evolution process dies out a.s. 



The case a > 1. A little calculus shows that 

r 1 r , ,^ 1 
1 — r a (1 — r) 



(3.2) ^(^)^________ln(l_a(l-r)), re(l--,l). 



To complete the proof of Corollary 1 we will need the following properties of m(r). 
(PI) m(r) is continuous on (1 — 1/a, 1], lim m(r) = oo and limTO(r) = a/2. 

ril-l/a rtl 

(P2) If a > 3/2 then m(r) is strictly decreasing on (1 — i, 1) 

(P3) If 1 < a < 3/2 then there exists G (1 — i, 1) such that ■m{r) is strictly decreasing on (1 — i, Tq) and 
strictly increasing on (r^, 1). 

The proof of (PI) is simple and we will omit it. The proofs of (P2) and (P3) require some work, so we will 
postpone them for now and complete the proof of Corollary 1 assuming (P2) and (P3) have been established. 
We consider three cases. 

(i) If a > 2 and r < 1, then by (P2) m(r) > m(l) = a/2 > 1, so (II) holds for all r G (1 - 1/a, 1). Also, 
m(l) = a/2 implies (II) holds for a > 2 but fails for a = 2. 

(ii) If 3/2 < a < 2 then by m{l) < 1, and hence by (PI) and (P2) there exists a unique Tc G (1 — 1/a, r^) 
such that m(rc) = 1. By (P2), (II) holds for r < rc but fails for r > re- 

(iii) If 1 < a < 3/2 then by (PI) and (P3) m{ra) < 1. It follows that there exists a unique rc G (1 — 1/a, Tq) 
such that m{rc) = 1, m(r) > 1 on (1 — l/a,rc) and m(r) < 1 on (rc, 1]. 
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The proof of Corollary 1 is now complete except for the proofs of (P2) and (P3). At this point it is 
convenient to change variables. If we define the function 

g{x) = 1 — X ^ — (x — x^)ln(l ), xe(a, oo), 

a X 

then ^ 

Moreover, m is increasing (decreasing) on the interval (ri,r2) iff g is increasing (decreasing) on the interval 
((1 — ri)^^, (1 — r2)^^). A little calculation gives the first three derivatives of 

g'{x) = -l~ ---{2x- 1) ln(l - -) 

^ ^ ' x-a ' ^ x' 

„, , a — 3ax + 2a;^ 2 a 

9 W = 7 yi 1 ) 

x{x — ay a X 

, + ax{2a — 3) 

9 [X) = 2? • 

x'^[x — a)'' 

With some additional calculation one can explicitly check that 

(3.3) limg'{x) = —oo, lim ^'(a;) = 0, 

(3.4) lim5"(a;) = +00, lim g"{x)=0. 

xia X— S- + CXO 

We also note that by (PI), 

(3.5) liiiig{x) = oo, lim g{x) = a/2. 

x^a X— >+oo 

Suppose a > 3/2. Then g"'{x) < for all x > a, and hence the function g" is strictly decreasing on 
(a, oo). In view of p.4p . g" must be positive on (a, +oo), which implies g' is strictly increasing on (a, +oo). 
In view of p.3p . g' must be negative on (a, +oo), which implies g is strictly decreasing on (a, +oo). This 
means that m{r) is strictly decreasing on (1 — 1/a, 1), so (P2) is proved. 

Finally, suppose that 1 < a < 3/2, and put 6 = a/(3 — 2a). Then b > a, g'" < on (a, 6) and g'" > 
on (6, oo). As a consequence, g" is strictly decreasing on (a, b) and strictly increasing on (6, oo). In view of 
()3.4p there must exist a unique c S (a, 5) such that g" > on (a, c) and g" < on (c, oo). This implies g' 
is strictly increasing on (a,c) and strictly decreasing on (c, oo). In view of p.3|) there must exist a unique 
Xa G (a, c) such that 5' < on (a, Xq) and > on {xa, 00). This implies g is strictly decreasing on (a, Xa) 
and strictly increasing on (a;a,oo). By setting = 1 — 1/xa and using the correspondence between the 
functions m and g we obtain (P3). 
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